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Abstract 

We review the basic notion of correlations in point processes, adapted to the lan- 
guage of high energy physicists. The measurement of accessible information on correla- 
tions by means of correlation integrals is summarized. Applications to the measurement 
of Bose-Einstein interferometry as well as some pitfalls are discussed. 
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1 Introduction 



In moving to ever higher energies in particle physics, the experimentalist faces a rapidily 
increasing complexity of the observed events. Beyond posing questions of physics alone, 
the task to compare experimental data with competing theoretical and phenomenological 
models raises many questions purely statistical in nature. The latter define a field of their 
own: multiparticle statistics. 

The task of multiparticle statistics is to provide a framework for maximal utilization of 
the information provided by experiments. This includes questions of measurement of single 
and multiparticle spectra, correlations among two and more particles, statistical errors due 
to finite event-sample size, limited detector acceptance, misidentification of tracks and much 
more. 

In these lectures, we summarize basic concepts of point processes, correlations and their 
measurement. To demonstrate the discriminative power of advanced correlation measure- 
ments, we discuss a recent test of dynamical assumptions in modeling Bose-Einstein corre- 
lations. 

2 Cross sections, point processes and correlations 

Correlations (in the widest sense) among final state particles are commonly expressed in 
terms of exclusive or inclusive differential cross sections denoted <7 e xci an d <r incl respectively. 
To be specific, we recall some common definitions jjj: 

j N (p v . . .,p N ) 

p i is the three-momentum of the i-th particle, and a to t = J2n °n is the total (inelastic) 
cross section, with <jn the integrated cross section for events with N final state particles 
(here to be taken as identical for simplicity of notation) . The ratio Pn = &n / Ctot gives the 
multiplicity distribution in full phase-space. 

The densities jn and p q are very different objects: jn(Pi, ■ ■ ■ ,PN)d 3 Pi • • • d 3 p N is pro- 
portional to the probability that in an event with exactly N particles we find simulta- 
neously the first particle in a box of size d 3 p 1 centered on p x , the second in a box at 
p 2 , ■ • ■ , and the N-th at p N . Thus, jm characterizes the statistical properties of samples 
built exclusively from events with exactly TV particles (exclusive samples). By contrast, 
pqipi, ■ ■ ■ 1 p q )d 3 p 1 ■ ■ -d 3 p q is the average number of unordered gr-tuplcs of particles with 
momenta simultaneously within infinitesimal boxes centered on Pj,..., p q per event (no 
matter what ./V is). Therefore p q characterizes samples which include all events (inclusive 
samples). While various other names are in use, we follow Ref. in referring to j'at as a 
Janossy density and to p q as a factorial moment density. 

Both functions are symmetrized with respect to permutations of their arguments. In 
fact, the labels "first" particle, "second" particle,. .. are arbitrary and carry no physical 
information. Therefore it is customary to count all TV! (q\) permutations of labels as sepa- 
rate, independent events (g-tuples). This symmetrization has nothing to do with quantum 
mechanical "indistinguishability" of particles, but reflects a mere convention designed to 
simplify the resulting formalism. 



1 d 3N a cxcl 
otot <PPi d 3 p 2 ■ ■ ■ d 3 p N ' 
1 rfXrow, (2) 



otot d 3 Pi d 3 p 2 ■ ■ ■ d 3 p 
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While jisr /NlPpf, normalized to unity, represents a conventional joint probability density 
of N random variables p i , it is a mistake to regard p q as such. It is constructed from inclusive 
samples, where the number of random variables (N) is a again a random variable. 

In fact, integration over an arbitray region of phase-space fl gives the q-th factorial 
moment of the random multiplicity n in this domain: 



£,(«)= / p q (p 1 ,...,p q )d*p 1 ...d<p q = <nM) . (3) 
Jo \ / si 

We use the common notation rv- q ' = n(n— 1) • ■ ■ (n—q+1) for factorials. 

All the above definitions are given in terms of cross sections. However, all experimental 
multiplicity and correlation measurements can be viewed as particular counting procedures 
of particles or particle-tuples in certain domains. Thus, we pose the questions: How do we 
have to count in order to obtain a certain type of information and, moreover, how do we 
count most efficiently? 

The appropriate tool to tackle such questions is the theory of point processes ||, |). 
Consider a sample of iV cv events, each labeled by an index a = 1, . . . , N cv . For greater 
generality, we subsequently denote the "positions" of the particles (=points) by X°, (i = 
1,...,N), where X can refer to any set of coordinates. Frequently used examples are 
rapidity y, rapidity-azimuth (y, $) or even a combination of continuous and discrete variables 
such as (p, s), with s labeling the charge, spin or species of the particleQ 

The density of such points at x in one particular event a is most conveniently represented 
by the "random Dirac comb" 

N 

pt(x) = J2S(x-Xl), (4) 

ii=l 

which, when integrated over a certain domain f2, just gives the number of particles n in that 
domain. More generally, the simultaneous behavior ( = correlation) of q of these particles 
in that event is represented by the restricted tensor product of Dirac combs 

N 

p a q {x x ,...,x q ) = Yl S( Xl -Xl)S(x 2 -Xl)---S(x q -Xl) 7 (5) 

= 1 

which just acts as a q-tuple counter in event a. Note that it doesn't make sense to count 
the same particle more than once (hence the restriction on the indices in the multiple sum) . 

Meaningful results are extracted by averaging over the inclusive event sample, to yield 
the (/-tuple density 

JV CV 

Pq = ( Pq ) =N~ 1 Y i p a q , (6) 

o=l 

which is nothing but the counting prescription for the factorial moment densities eq. (^) . 

A point process is fully determined by the knowledge of either all p q , all j'jv or, most 
conveniently, by its generating functional. The latter is defined by 



Z[\(x)] = ^cxp (^J p{x) log [1 + X(x)] dx 

= ^ — J Pqjxi, X q )\(xi) ■ ■ ■ X(x q ) dx x ■ ■ ■ dx q 



(7) 
(8) 



3 In this case all following integrations contain implicitly sums over s. 
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Once we know Z[A(a;)], the p q as well as the jn can be obtained via functional derivatives 
with respect to the test function A: 



p q {x x ,...,x q ) = 
j N (xi, . . . ,x N ) = 



6«Z[\(x)] 



8X(xi) ■ ■ ■ 5\(x q ) 
S q Z[X(x) - 1] 



S\(xi) ■ ■ ■ S\(xn) 



(9) 
(10) 

A=0 



In order to understand what we mean by "correlations" we first discuss the meaning of 
statistical independence in point processes. The latter is defined by full factorization of the 
factorial moment densities p q (xi, . . . ,x q ) — pi(xi)pi(x 2 ) ■ ■ ■ pi(x q ) for all q. In this case 
we can readily sum up the series @ to obtain 

Z[X(x)] = exp Pl (x)X(x)dx) , (11) 

which is the generating functional of the so-called Poisson process. In this process, the 
multiplicity of points in any arbitrary domain f2 follows a Poisson distribution. This process 
plays a similar central role for point processes as the Gaussian does for the statistics of 
continuous random variables. 

Genuine correlations among points are then quantifiable by deviations from the Poisson 
process. Consider the family of functions, called cumulant densities , obtained by functional 
derivatives of log Z[X] : 



SX(xi) ■ ■ ■ 5X(x q ) 



(12) 

A=0 



For the Poisson process (|Tl]) we see that C\ — pi and all higher cumulant densities vanish. 
In this sense, nonvanishing C q with (q > 2) quantify genuine correlations, i.e. deviations 
from the Poisson process. 

It is quite possible that only a few orders of cumulants are nonzero. For example in heavy 
ion reactions cumulants of third and higher orders are highly suppressed. Such processes can 
be modeled as poisson processes of "clusters" , each cluster decaying into one or 2 daughter 
particles. 

The generating functional Z[X] is a convenient bookkeeping device for establishing all 
kinds of relations among j'jv , p q and C q , for example 

C 2 (x 1 ,x 2 ) = p2(x!,x 2 ) - pi(x 1 )p 1 (x 2 ) , (13) 

C 3 (x 1 ,X 2 ,X 3 ) = p 3 (xi,X 2 ,X S ) - Pl(x 1 )p 2 (x 2 ,X3) - Pl(x 2 )p2(X3,X 1 ) 

- p 1 {x 3 )p 2 (x 1 ,x 2 ) + 2p 1 (x 1 )p 1 (x 2 )p 1 (x 3 ) etc. (14) 

Notice the subtraction of various products of lower order factorial moment densities in the 
construction of cumulants, sometimes called "removal of combinatorial background". It is 
this property that makes cumulants a favourable tool for discriminative experimental anlysis, 
but at the same time more difficult to measure. 



3 Correlation integrals 

If we had experimental knowledge of correlations in terms of either j^, p q or C q to all orders, 
the multiparticle production process would be completely determined. This can never, of 
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course, be achieved in practice. However, while higher-order correlation functions can never 
be sampled fully differentially, one can still try to extract as much information as possible 
from a give data sample by measuring integrals over various domains f2 as in eq. (^). 

Conventional measurements of correlations proceed first to discretize the continuous 
variable X ("binning the data") and then to find £ q by averaging over all events the counts 
rim m every bin 

c nv = ( E ( 15 ) 

\bins m I 

By contrast, so-called correlation integrals rely on distances between pairs of points = 
\X ix — X i2 \ rather than counting particles in predefined bins |). More generally, every 
correlation integral of order q assigns a "size" to every possible g-tuple of particles. The way 
this assignment is done distinguishes the different versions of correlation integrals. Finally, 
they count the number of q-tuples with a given size e ( "differential forms" ) or the ones 
smaller than a given size ("integral forms"). For a large data sample, this corresponds to 
an integration over p q in specific domains f2(e). 

For the Star integral ||, the domain Q, is given by the collection of N spheres of radius 
e, each centered at one of the N particles in the event. For a given event a the number of 
particles ("sphere count") within one of these spheres is, not counting the particle at the 
center X^ , 

N 

a = n(Xi 17 e)=^2&{e-X lli2 ), i 2 7^ «i , (16) 

where a is an "ultra short" notation needed for some lengthy formulae below. With this 
elementary counter the factorial moment of order q is simply obtained by 

*f M = (j^niX^e)^ = ^> [9 ~ 1] ) • (17) 

One can show |(| that the above counting prescription corresponds to integrating eq. (^) 
using for Q a particular "Star" domain implemented via theta functions = 6(e — \x\ — 
Xj\), restricting all q—1 coordinates Xj to within a distance e of x±: 

£ 9 Star (e) - J p q (x!,...,x q ) 612613 . . . 0i fl dxi . . . dx q . (18) 

The superiority of correlation integrals in general over the conventional Bialas-Peschanski 
factorial moments is discussed and demonstrated in || . The further advantage of the 
particular "Star" domain lies in the fact that eq. ( [l7j) requires typically N CV *N 2 computation 
steps for any order q, whereas other types of correlation integral have order _/V cv * N q 
complexity, which quickly becomes unmanageable for higher orders. 

In order to eliminate, among other things, the overall total cross section, it has become 
customary in high energy physics to measure normalized factorial moments Q . The denom- 
inator used for such normalization should be made up of the uncorrelated background, p\ . 
While it can be implemented in a number of ways, we prefer the "vertical" normalization, 
in which p\ is integrated over exactly the same domain Q as the inclusive density p q in the 
numerator. Thus for the Star integral, the normalized moment is 

F star (e) ee = I Es^h ■ ■ ■ ' ggj Q i2Qi3 ■ ■ ■ gig dxi ■ ■ ■ dx q 

9 e orm J pi(x 1 )...p 1 (x q )Q 12 e 13 ...Q lq dx 1 ...dx q ' 
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We have shown || that the denominator ££ orm is given by the following double event average: 
with X& = | JT" — X h i2 1 measuring the distance between two particles taken from different 
events a and b, and the "ultra short" notation 

b = h b (Xl,e)=J2®(z-Kl) (20) 

we get 

£»°™(e) = (^£i»°™(i)J = (^J ■ (21) 

Note that the outer event average and sum over i\ are taken over the center particle taken 
from event a, each of which is used as the center of sphere counts hh{Xf , e) taken over other 
events b ^ a in the inner event average. We thus see the natural emergence of the heuristic 
procedure of normalization known as "event mixing" ^j. This counting prescription 
has complexity N 2 V * iV 2 — unmangeable for large event samples. However, in practical 
applications it is sufficient to reduce the inner average over 6-events to a small subsample 
b = a — 1, a — 2, . . . , a — A, consisting only of A — 1 events and requiring only A * N cv * N 2 
computation steps. This procedure we call "reduced" event mixing. 

Another essential advantage of the Star domain is the fact that on top of the F^ tar we 
get the integrals of cumulants (almost) for free, once the counts a and b per particle i are 
perfomed. Integrating the C q over the Star integral domain, we define normalized (factorial) 
cumulants #f tar (e) = / g (e)/^ orm (e), with 

f q (e) = J C q (xi, ... ,x q ) 612613 ... Qi q dxi .. .dx q . (22) 

The latter can be written entirely in terms of the sphere counts a and b introduced previously! 
Defining for convenience the "z-particle cumulant" f q (i) so that /?(*)) = fi 1 we ^ n< ^ 
I 

m = a -(b), (23) 
/ 3 (i) = a [2l -(6[ 2 ])-2a(6) + 2(6) 2 , etc. (24) 

While computationally more expensive than the Star integral by orders of magnitude, 
other types of integration domains O are also useful, in particular when a comparison of 
theoretical models with data dictate a specific choice of variables and integration domain. 
This is typically the case in measurements of Bose-Einstein correlations, where a preferred 
variable is the 4-momentum difference q^j — [{p i — p^) 2 — [E^ — Ej) 2 ] 1 / 2 . 

We then may study differential forms of integrations over p q or C q in a fully symmetrical 
manner (called GHP topology) such as 

C q (e) = J C q ( Pl , ..., Pq )S(e- ]T 4) d 3 Pl ■ ■ ■ d% . (25) 

i<j = l 

Gther g-tuple size prescriptions (used below) are obtained by changing the argument of the 
5-function appropriately, e.g. S(e - Yn<j=i Qij) 01 ^( e ~ m ax(gi 2 , . . . , q q -\, q )). 

The corresponding counting prescriptions are conveniently written in terms of the generic 
g-tuple counter 

eie 2 - e 9 , s _ J 1 if the "size" of the q-tuple is within e + Se , . 

hi a .-i q {£) = \ o otherwise, ( ' 
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where the g-tuple is composed of particle i\ taken from event ex, particle i 2 taken from a 
different event 62, etc., and the "size" is evaluated, e.g., according to one of above quoted 
prescriptions. 

The counting algorithms for p q (e) and the first C g (e) are 
P«{e)6e = ( E «-V e )) (27) 

= (ew)-((e#w)) ( 28 ) 
= ( E ^"(4 - 3 ( (E ^)) ) + 2 (((E$( e )) ) ) -( 29 ) 

Wi#* / \ Wj.k /&/„ \ \ W,* / c / 5 / a 



Integrals ( |25| ) over uncorrelated tensor products of pi, needed for normalization, are 
sampled in similar ways : 

Pl ® Pl ®---® Pl {e)8e =((■••( E C?( e )M-j ■ (3°) 

Note that multiple event averages, e.g. ((. . .) )& = J2 a ^b /Nev{N ev — 1), always run over 
unequal events to avoid noticeable sampling biases. Again, inner event averages can run 
over a small fraction of the full sample to keep computing times in manageable ranges M . 



4 Applications to Bose-Einstein correlation measure- 
ments 

As an application, we discuss a recent test [pi of quantum statistical (QS) models for Bose- 
Einstein correlations among identical pions [g[ [l(| |ll| . The latter postulate a specific form 
of the generating functional (0) with C 2 as freely parametrizable function. Once C2 is 
determined from experiment, all higher cumulant densities are fully specified with no further 
adjustable parameter. One therefore can test experimentally the validity of the postulated 
generating functional by comparing higher order cumulants from experiment with the model 
predictions, without having to rely on a particular parametrization of C2. 

When relative phases are neglected, the second and third (reduced) "QS cumulants" of 
interest arc Q 

k 2 = -^- = 2p(l-p)d 12 +p 2 d 2 12 , (31) 
C 

k 3 = — - — - — = 2p 2 (l ~p)[di 2 d23 + d 23 d 3 i + d 3 idi 2 ] + 2p 3 di 2 d 23 d 3 i , (32) 
PiOpiOpi 

where p\®p\ = Pi{P\) P i{p 2 ) and the dij=d(qij) are functions of the 4-momentum differences 

While in principle calculable from a given density matrix, the functions dij are usually 
parametrized in a plausible and/or convenient way, such as Gaussian (d^ = exp(— r 2 qf ■ )), 
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Figure 1: Third-order differential cumulants AK^{e) integrated with GHP max (a) and 
GHP sum (b) topologies ||. Filled circles represent UAl minimum-bias data, while open 
symbols show predictions for AK% from QS theory and parameter values obtained from fits 
to AK 2 - Open triangles are predictions based on the QS power-law parametrization; open 
squares are QS exponential predictions. 



exponential (dij = exp(— rqij)) and power-law parametrizations (dij = q^ a ). Since the 
Gaussian does not give viable fits, only the latter two parametrizations are used below. 

In Figure 1, we show the results of a preliminary^ comparison of the normalized third 
order cumulant AK% ( |29| ) obtained from UAl minimum-bias data (pp-reactions at y/s = 
630 GeV) with QS-predictions (|32|). Besides the GHP-sum topology used in (b), we show 
in (a) a separate analysis using the "GHP max" topology which bins triplets according 
to the largest of the three momentum differences, max(qi2, 923, Q3i)- Fit parameter values 
used for the respective power law and exponential parametrizations were taken from the 
QS fit ( |3l| ) to AK 2 obtained from the same data sample. All theoretical points shown are 
determined only up to an additive constant, so that the curves may be shifted up and down. 
It is clear, though, that the shape of third-order cumulant data measured differs appreciably 
from that predicted by the QS formulae and parameter values from Ai^- This conclusion 
holds independently of the topology used and of the functional form taken for d. 

The results of this analysis may appear, at first sight, to contradict the conclusion [ fl0| , 
based on an earlier UAl paper |12| , that QS theory was compatible with higher-order fac- 
torial moments. The apparent discrepancy is explained by pointing out that the recent 
improvement of measurement techniques have permitted the present direct measurements 

4 This preliminary analysis is based on like-sign particles in a restricted phase space region with good 
detector acceptance. The analysis is currently being extended to an enlarged phase space region. 
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of cumulants, which are considerably more sensitive than moments. The latter are dom- 
inated numerically by the combinatorial background of lower order correlations and thus 
contain mostly redundant information. 

5 Final remarks 

The use of correlation integrals permits much more accurate measurements and hence will 
likely reveal more detailed structure of the underlying dynamics. In particular, cumulants are 
promising and sensitive tools to obtain refined insight into various production mechanisms. 

Greater accuracy requires, however, that possible biases be understood on a deeper level 
than before. One such bias arising generally in the measurement of correlations is due to 
the finite size of event samples and can have quite noticeable size. In practice, one has to 
use unbiased estimators, which correct for this effect (cf. 0). 

Another caveat is the comparison of experimental correlation data with theoretical pre- 
dictions, when the latter are given in terms of differentially normalized (reduced) correlation 
functions such as ( |3l| ) and (|32|). Experimentally, one can never measure fully differential 
ratios; rather, the numerator and denominator are averaged over some bin of finite size f2 
(however small) before the ratio is taken. The discrepancy to fully differential ratios can be 
quite substantial. A procedure to overcome this problem amounts to a Monte Carlo integra- 
tion of a theoretical correlation function sampled according to the experimental uncorrelated 
one-particle distribution; this and other details are explained in ||. 
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